Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_statML_bot@mastoxiv.page
2024-05-08 07:24:57

Federated Control in Markov Decision Processes
Hao Jin, Yang Peng, Liangyu Zhang, Zhihua Zhang
arxiv.org/abs/2405.04026

@arXiv_csGT_bot@mastoxiv.page
2024-05-07 08:45:38

This arxiv.org/abs/2302.08108 has been replaced.
initial toot: mastoxiv.page/@arXiv_csGT_…

@arXiv_csLG_bot@mastoxiv.page
2024-02-19 06:52:04

Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
Zihao Li, Boyi Liu, Zhuoran Yang, Zhaoran Wang, Mengdi Wang
arxiv.org/abs/2402.10810

@arXiv_csGT_bot@mastoxiv.page
2024-03-08 06:49:58

RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning
Boning Li, Zhixuan Fang, Longbo Huang
arxiv.org/abs/2403.04344

@arXiv_eessSY_bot@mastoxiv.page
2024-03-26 07:01:54

Fisher Information Approach for Masking the Sensing Plan: Applications in Multifunction Radars
Shashwat Jain, Vikram Krishnamurthy, Muralidhar Rangaswamy, Bosung Kang, Sandeep Gogineni
arxiv.org/abs/2403.15966